Overview
Brought to you by YData
Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 14000 |
| Missing cells | 7705 |
| Missing cells (%) | 3.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.5 MiB |
| Average record size in memory | 116.0 B |
Variable types
| DateTime | 1 |
|---|---|
| Numeric | 8 |
| Categorical | 5 |
| Text | 2 |
month is highly overall correlated with quarter | High correlation |
quarter is highly overall correlated with month | High correlation |
residents is highly overall correlated with water_consumption | High correlation |
water_consumption is highly overall correlated with residents | High correlation |
apartment_type has 426 (3.0%) missing values | Missing |
temperature has 441 (3.1%) missing values | Missing |
income_level has 426 (3.0%) missing values | Missing |
amenities has 5997 (42.8%) missing values | Missing |
appliance_usage has 415 (3.0%) missing values | Missing |
timestamp has unique values | Unique |
guests has 9658 (69.0%) zeros | Zeros |
Reproduction
| Analysis started | 2025-03-24 06:38:01.482429 |
|---|---|
| Analysis finished | 2025-03-24 06:38:04.977127 |
| Duration | 3.49 seconds |
| Software version | ydata-profiling vv4.15.1 |
| Download configuration | config.json |
Variables
timestamp
Date
Unique 
| Distinct | 14000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 109.5 KiB |
| Minimum | 2002-01-01 00:00:00 |
|---|---|
| Maximum | 2014-10-11 08:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
residents
Real number (ℝ)
High correlation 
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.0784286 |
| Minimum | -99 |
|---|---|
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 280 |
| Negative (%) | 2.0% |
| Memory size | 109.5 KiB |
Quantile statistics
| Minimum | -99 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 104 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 9.2416653 |
|---|---|
| Coefficient of variation (CV) | 4.4464676 |
| Kurtosis | 77.677125 |
| Mean | 2.0784286 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -8.5652378 |
| Sum | 29098 |
| Variance | 85.408378 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 4965 | |
| 2 | 3074 | |
| 5 | 2547 | |
| 4 | 2525 | |
| 1 | 609 | 4.3% |
| -99 | 39 | 0.3% |
| -11 | 36 | 0.3% |
| -55 | 34 | 0.2% |
| -22 | 32 | 0.2% |
| -33 | 32 | 0.2% |
| Other values (4) | 107 | 0.8% |
| Value | Count | Frequency (%) |
| -99 | 39 | 0.3% |
| -88 | 31 | 0.2% |
| -77 | 28 | 0.2% |
| -66 | 23 | 0.2% |
| -55 | 34 | 0.2% |
| -44 | 25 | 0.2% |
| -33 | 32 | 0.2% |
| -22 | 32 | 0.2% |
| -11 | 36 | 0.3% |
| 1 | 609 |
| Value | Count | Frequency (%) |
| 5 | 2547 | |
| 4 | 2525 | |
| 3 | 4965 | |
| 2 | 3074 | |
| 1 | 609 | 4.3% |
| -11 | 36 | 0.3% |
| -22 | 32 | 0.2% |
| -33 | 32 | 0.2% |
| -44 | 25 | 0.2% |
| -55 | 34 | 0.2% |
apartment_type
Categorical
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 426 |
| Missing (%) | 3.0% |
| Memory size | 109.5 KiB |
| 2BHK | |
|---|---|
| 1BHK | |
| Bungalow | |
| 3BHK | |
| Cottage | |
| Other values (2) |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 5.3083837 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Studio |
|---|---|
| 2nd row | Cottage |
| 3rd row | 1BHK |
| 4th row | Cottage |
| 5th row | 2BHK |
Common Values
| Value | Count | Frequency (%) |
| 2BHK | 3157 | |
| 1BHK | 3019 | |
| Bungalow | 1925 | |
| 3BHK | 1909 | |
| Cottage | 1824 | |
| Studio | 1186 | 8.5% |
| Detached | 554 | 4.0% |
| (Missing) | 426 | 3.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2bhk | 3157 | |
| 1bhk | 3019 | |
| bungalow | 1925 | |
| 3bhk | 1909 | |
| cottage | 1824 | |
| studio | 1186 | 8.7% |
| detached | 554 | 4.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 10010 | |
| H | 8085 | |
| K | 8085 | |
| t | 5388 | 7.5% |
| o | 4935 | 6.8% |
| a | 4303 | 6.0% |
| g | 3749 | 5.2% |
| 2 | 3157 | 4.4% |
| u | 3111 | 4.3% |
| 1 | 3019 | 4.2% |
| Other values (12) | 18214 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 72056 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| B | 10010 | |
| H | 8085 | |
| K | 8085 | |
| t | 5388 | 7.5% |
| o | 4935 | 6.8% |
| a | 4303 | 6.0% |
| g | 3749 | 5.2% |
| 2 | 3157 | 4.4% |
| u | 3111 | 4.3% |
| 1 | 3019 | 4.2% |
| Other values (12) | 18214 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 72056 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| B | 10010 | |
| H | 8085 | |
| K | 8085 | |
| t | 5388 | 7.5% |
| o | 4935 | 6.8% |
| a | 4303 | 6.0% |
| g | 3749 | 5.2% |
| 2 | 3157 | 4.4% |
| u | 3111 | 4.3% |
| 1 | 3019 | 4.2% |
| Other values (12) | 18214 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 72056 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| B | 10010 | |
| H | 8085 | |
| K | 8085 | |
| t | 5388 | 7.5% |
| o | 4935 | 6.8% |
| a | 4303 | 6.0% |
| g | 3749 | 5.2% |
| 2 | 3157 | 4.4% |
| u | 3111 | 4.3% |
| 1 | 3019 | 4.2% |
| Other values (12) | 18214 |
temperature
Real number (ℝ)
Missing 
| Distinct | 2490 |
|---|---|
| Distinct (%) | 18.4% |
| Missing | 441 |
| Missing (%) | 3.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.566559 |
| Minimum | 10 |
|---|---|
| Maximum | 35 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 109.5 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 11.16 |
| Q1 | 16.34 |
| median | 22.58 |
| Q3 | 28.85 |
| 95-th percentile | 33.69 |
| Maximum | 35 |
| Range | 25 |
| Interquartile range (IQR) | 12.51 |
Descriptive statistics
| Standard deviation | 7.2164468 |
|---|---|
| Coefficient of variation (CV) | 0.31978498 |
| Kurtosis | -1.198012 |
| Mean | 22.566559 |
| Median Absolute Deviation (MAD) | 6.26 |
| Skewness | -0.023018785 |
| Sum | 305979.98 |
| Variance | 52.077105 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20.77 | 14 | 0.1% |
| 24.98 | 13 | 0.1% |
| 25.57 | 13 | 0.1% |
| 22.34 | 13 | 0.1% |
| 21.6 | 13 | 0.1% |
| 22.15 | 13 | 0.1% |
| 30.36 | 13 | 0.1% |
| 26.86 | 13 | 0.1% |
| 15.59 | 13 | 0.1% |
| 27.87 | 13 | 0.1% |
| Other values (2480) | 13428 | |
| (Missing) | 441 | 3.1% |
| Value | Count | Frequency (%) |
| 10 | 4 | < 0.1% |
| 10.01 | 7 | |
| 10.02 | 5 | |
| 10.03 | 5 | |
| 10.04 | 4 | < 0.1% |
| 10.05 | 11 | |
| 10.06 | 2 | < 0.1% |
| 10.07 | 2 | < 0.1% |
| 10.08 | 7 | |
| 10.09 | 6 |
| Value | Count | Frequency (%) |
| 35 | 3 | |
| 34.99 | 3 | |
| 34.98 | 7 | |
| 34.97 | 5 | |
| 34.96 | 3 | |
| 34.95 | 2 | < 0.1% |
| 34.94 | 3 | |
| 34.93 | 7 | |
| 34.92 | 3 | |
| 34.91 | 3 |
humidity
Text
| Distinct | 4515 |
|---|---|
| Distinct (%) | 32.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 109.5 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.905 |
| Min length | 4 |
Unique
| Unique | 1495 ? |
|---|---|
| Unique (%) | 10.7% |
Sample
| 1st row | 46.61 |
|---|---|
| 2nd row | 66.11 |
| 3rd row | 60.86 |
| 4th row | 50.58 |
| 5th row | 52.25 |
| Value | Count | Frequency (%) |
| 51.69 | 13 | 0.1% |
| 49.32 | 12 | 0.1% |
| 53.07 | 12 | 0.1% |
| 55.48 | 11 | 0.1% |
| 49.19 | 11 | 0.1% |
| 62.88 | 11 | 0.1% |
| 55.05 | 11 | 0.1% |
| 55.43 | 11 | 0.1% |
| 55.07 | 11 | 0.1% |
| 60.5 | 11 | 0.1% |
| Other values (4500) | 13886 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 13619 | |
| 5 | 9172 | |
| 4 | 7815 | |
| 6 | 6909 | |
| 3 | 5335 | 7.8% |
| 7 | 4639 | 6.8% |
| 2 | 4243 | 6.2% |
| 8 | 4210 | 6.1% |
| 1 | 4127 | 6.0% |
| 9 | 4097 | 6.0% |
| Other values (84) | 4504 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 68670 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 13619 | |
| 5 | 9172 | |
| 4 | 7815 | |
| 6 | 6909 | |
| 3 | 5335 | 7.8% |
| 7 | 4639 | 6.8% |
| 2 | 4243 | 6.2% |
| 8 | 4210 | 6.1% |
| 1 | 4127 | 6.0% |
| 9 | 4097 | 6.0% |
| Other values (84) | 4504 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 68670 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 13619 | |
| 5 | 9172 | |
| 4 | 7815 | |
| 6 | 6909 | |
| 3 | 5335 | 7.8% |
| 7 | 4639 | 6.8% |
| 2 | 4243 | 6.2% |
| 8 | 4210 | 6.1% |
| 1 | 4127 | 6.0% |
| 9 | 4097 | 6.0% |
| Other values (84) | 4504 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 68670 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 13619 | |
| 5 | 9172 | |
| 4 | 7815 | |
| 6 | 6909 | |
| 3 | 5335 | 7.8% |
| 7 | 4639 | 6.8% |
| 2 | 4243 | 6.2% |
| 8 | 4210 | 6.1% |
| 1 | 4127 | 6.0% |
| 9 | 4097 | 6.0% |
| Other values (84) | 4504 | 6.6% |
water_price
Real number (ℝ)
| Distinct | 210 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.65792357 |
| Minimum | -99 |
|---|---|
| Maximum | 3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 272 |
| Negative (%) | 1.9% |
| Memory size | 109.5 KiB |
Quantile statistics
| Minimum | -99 |
|---|---|
| 5-th percentile | 1.04 |
| Q1 | 1.32 |
| median | 1.63 |
| Q3 | 2.1125 |
| 95-th percentile | 2.82 |
| Maximum | 3 |
| Range | 102 |
| Interquartile range (IQR) | 0.7925 |
Descriptive statistics
| Standard deviation | 8.7657763 |
|---|---|
| Coefficient of variation (CV) | 13.323396 |
| Kurtosis | 79.572013 |
| Mean | 0.65792357 |
| Median Absolute Deviation (MAD) | 0.36 |
| Skewness | -8.7155616 |
| Sum | 9210.93 |
| Variance | 76.838834 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.58 | 157 | 1.1% |
| 1.7 | 153 | 1.1% |
| 1.55 | 149 | 1.1% |
| 1.62 | 148 | 1.1% |
| 1.63 | 148 | 1.1% |
| 1.65 | 146 | 1.0% |
| 1.6 | 141 | 1.0% |
| 1.67 | 140 | 1.0% |
| 1.71 | 140 | 1.0% |
| 1.72 | 139 | 1.0% |
| Other values (200) | 12539 |
| Value | Count | Frequency (%) |
| -99 | 26 | |
| -88 | 33 | |
| -77 | 31 | |
| -66 | 33 | |
| -55 | 27 | |
| -44 | 34 | |
| -33 | 28 | |
| -22 | 33 | |
| -11 | 27 | |
| 1 | 48 |
| Value | Count | Frequency (%) |
| 3 | 20 | 0.1% |
| 2.99 | 39 | |
| 2.98 | 51 | |
| 2.97 | 41 | |
| 2.96 | 42 | |
| 2.95 | 43 | |
| 2.94 | 46 | |
| 2.93 | 31 | |
| 2.92 | 35 | |
| 2.91 | 31 |
period_consumption_index
Real number (ℝ)
| Distinct | 450 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1528899 |
| Minimum | -0.13078231 |
|---|---|
| Maximum | 2.3523113 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 2 |
| Negative (%) | < 0.1% |
| Memory size | 109.5 KiB |
Quantile statistics
| Minimum | -0.13078231 |
|---|---|
| 5-th percentile | 0.83 |
| Q1 | 0.97 |
| median | 1.15 |
| Q3 | 1.33 |
| 95-th percentile | 1.47 |
| Maximum | 2.3523113 |
| Range | 2.4830936 |
| Interquartile range (IQR) | 0.36 |
Descriptive statistics
| Standard deviation | 0.22904708 |
|---|---|
| Coefficient of variation (CV) | 0.19867212 |
| Kurtosis | 0.9489001 |
| Mean | 1.1528899 |
| Median Absolute Deviation (MAD) | 0.18 |
| Skewness | -0.063877745 |
| Sum | 16140.459 |
| Variance | 0.052462566 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.47 | 225 | 1.6% |
| 1.43 | 224 | 1.6% |
| 1.05 | 216 | 1.5% |
| 1.46 | 214 | 1.5% |
| 0.87 | 213 | 1.5% |
| 1.21 | 213 | 1.5% |
| 0.92 | 212 | 1.5% |
| 0.91 | 212 | 1.5% |
| 1.03 | 211 | 1.5% |
| 1.27 | 209 | 1.5% |
| Other values (440) | 11851 |
| Value | Count | Frequency (%) |
| -0.1307823083 | 1 | |
| -0.01401255893 | 1 | |
| 0.0271364282 | 1 | |
| 0.07065422985 | 1 | |
| 0.1249019401 | 1 | |
| 0.1282957768 | 1 | |
| 0.1367372194 | 1 | |
| 0.1446200871 | 1 | |
| 0.1586693434 | 1 | |
| 0.166320506 | 1 |
| Value | Count | Frequency (%) |
| 2.35231127 | 1 | |
| 2.16208798 | 1 | |
| 2.153225826 | 1 | |
| 2.152694911 | 1 | |
| 2.139341172 | 1 | |
| 2.133695183 | 1 | |
| 2.128998887 | 1 | |
| 2.126050447 | 1 | |
| 2.118159918 | 1 | |
| 2.116545335 | 1 |
income_level
Text
Missing 
| Distinct | 420 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 426 |
| Missing (%) | 3.0% |
| Memory size | 109.5 KiB |
Length
| Max length | 12 |
|---|---|
| Median length | 6 |
| Mean length | 6.9846766 |
| Min length | 3 |
Unique
| Unique | 416 ? |
|---|---|
| Unique (%) | 3.1% |
Sample
| 1st row | Low |
|---|---|
| 2nd row | Upper Middle |
| 3rd row | Middle |
| 4th row | Middle |
| 5th row | Middle |
| Value | Count | Frequency (%) |
| middle | 9289 | |
| upper | 3966 | |
| low | 2276 | 13.0% |
| rich | 1593 | 9.1% |
| 7 | 2 | < 0.1% |
| 2 | < 0.1% | |
| b | 2 | < 0.1% |
| x | 2 | < 0.1% |
| npc<e | 1 | < 0.1% |
| qzrsg | 1 | < 0.1% |
| Other values (406) | 406 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 18596 | |
| e | 13287 | |
| i | 10898 | |
| M | 9310 | |
| l | 9309 | |
| p | 7959 | |
| r | 3990 | 4.2% |
| U | 3987 | 4.2% |
| 3966 | 4.2% | |
| o | 2298 | 2.4% |
| Other values (85) | 11210 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 94810 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| d | 18596 | |
| e | 13287 | |
| i | 10898 | |
| M | 9310 | |
| l | 9309 | |
| p | 7959 | |
| r | 3990 | 4.2% |
| U | 3987 | 4.2% |
| 3966 | 4.2% | |
| o | 2298 | 2.4% |
| Other values (85) | 11210 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 94810 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| d | 18596 | |
| e | 13287 | |
| i | 10898 | |
| M | 9310 | |
| l | 9309 | |
| p | 7959 | |
| r | 3990 | 4.2% |
| U | 3987 | 4.2% |
| 3966 | 4.2% | |
| o | 2298 | 2.4% |
| Other values (85) | 11210 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 94810 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| d | 18596 | |
| e | 13287 | |
| i | 10898 | |
| M | 9310 | |
| l | 9309 | |
| p | 7959 | |
| r | 3990 | 4.2% |
| U | 3987 | 4.2% |
| 3966 | 4.2% | |
| o | 2298 | 2.4% |
| Other values (85) | 11210 |
guests
Real number (ℝ)
Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.29292857 |
| Minimum | -2 |
|---|---|
| Maximum | 3 |
| Zeros | 9658 |
| Zeros (%) | 69.0% |
| Negative | 153 |
| Negative (%) | 1.1% |
| Memory size | 109.5 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 3 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.48916395 |
|---|---|
| Coefficient of variation (CV) | 1.6699086 |
| Kurtosis | -0.27393064 |
| Mean | 0.29292857 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.73699744 |
| Sum | 4101 |
| Variance | 0.23928137 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 9658 | |
| 1 | 4123 | |
| -1 | 151 | 1.1% |
| 2 | 65 | 0.5% |
| -2 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| -2 | 2 | < 0.1% |
| -1 | 151 | 1.1% |
| 0 | 9658 | |
| 1 | 4123 | |
| 2 | 65 | 0.5% |
| 3 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3 | 1 | < 0.1% |
| 2 | 65 | 0.5% |
| 1 | 4123 | |
| 0 | 9658 | |
| -1 | 151 | 1.1% |
| -2 | 2 | < 0.1% |
amenities
Categorical
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5997 |
| Missing (%) | 42.8% |
| Memory size | 109.5 KiB |
| Garden | |
|---|---|
| Swimming Pool | |
| Fountain | |
| Jacuzzi |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 8.4415844 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Swimming Pool |
|---|---|
| 2nd row | Swimming Pool |
| 3rd row | Garden |
| 4th row | Fountain |
| 5th row | Swimming Pool |
Common Values
| Value | Count | Frequency (%) |
| Garden | 2627 | |
| Swimming Pool | 2086 | 14.9% |
| Fountain | 1648 | 11.8% |
| Jacuzzi | 1642 | 11.7% |
| (Missing) | 5997 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| garden | 2627 | |
| swimming | 2086 | |
| pool | 2086 | |
| fountain | 1648 | |
| jacuzzi | 1642 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 8009 | 11.9% |
| i | 7462 | 11.0% |
| a | 5917 | 8.8% |
| o | 5820 | 8.6% |
| m | 4172 | 6.2% |
| u | 3290 | 4.9% |
| z | 3284 | 4.9% |
| G | 2627 | 3.9% |
| e | 2627 | 3.9% |
| d | 2627 | 3.9% |
| Other values (11) | 21723 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 67558 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 8009 | 11.9% |
| i | 7462 | 11.0% |
| a | 5917 | 8.8% |
| o | 5820 | 8.6% |
| m | 4172 | 6.2% |
| u | 3290 | 4.9% |
| z | 3284 | 4.9% |
| G | 2627 | 3.9% |
| e | 2627 | 3.9% |
| d | 2627 | 3.9% |
| Other values (11) | 21723 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 67558 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 8009 | 11.9% |
| i | 7462 | 11.0% |
| a | 5917 | 8.8% |
| o | 5820 | 8.6% |
| m | 4172 | 6.2% |
| u | 3290 | 4.9% |
| z | 3284 | 4.9% |
| G | 2627 | 3.9% |
| e | 2627 | 3.9% |
| d | 2627 | 3.9% |
| Other values (11) | 21723 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 67558 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 8009 | 11.9% |
| i | 7462 | 11.0% |
| a | 5917 | 8.8% |
| o | 5820 | 8.6% |
| m | 4172 | 6.2% |
| u | 3290 | 4.9% |
| z | 3284 | 4.9% |
| G | 2627 | 3.9% |
| e | 2627 | 3.9% |
| d | 2627 | 3.9% |
| Other values (11) | 21723 |
appliance_usage
Categorical
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 415 |
| Missing (%) | 3.0% |
| Memory size | 109.5 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 10841 | |
| 1.0 | 2744 | 19.6% |
| (Missing) | 415 | 3.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 10841 | |
| 1.0 | 2744 | 20.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 24426 | |
| . | 13585 | |
| 1 | 2744 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 40755 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 24426 | |
| . | 13585 | |
| 1 | 2744 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 40755 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 24426 | |
| . | 13585 | |
| 1 | 2744 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 40755 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 24426 | |
| . | 13585 | |
| 1 | 2744 | 6.7% |
water_consumption
Real number (ℝ)
High correlation 
| Distinct | 10635 |
|---|---|
| Distinct (%) | 76.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 164.46123 |
| Minimum | 35.54 |
|---|---|
| Maximum | 531.49 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 109.5 KiB |
Quantile statistics
| Minimum | 35.54 |
|---|---|
| 5-th percentile | 70.6865 |
| Q1 | 109.55 |
| median | 150.375 |
| Q3 | 206.765 |
| 95-th percentile | 304.6915 |
| Maximum | 531.49 |
| Range | 495.95 |
| Interquartile range (IQR) | 97.215 |
Descriptive statistics
| Standard deviation | 72.873894 |
|---|---|
| Coefficient of variation (CV) | 0.44310683 |
| Kurtosis | 0.78575269 |
| Mean | 164.46123 |
| Median Absolute Deviation (MAD) | 46.595 |
| Skewness | 0.91893074 |
| Sum | 2302457.2 |
| Variance | 5310.6044 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 136.19 | 6 | < 0.1% |
| 166.87 | 5 | < 0.1% |
| 138.97 | 5 | < 0.1% |
| 127.11 | 5 | < 0.1% |
| 118.26 | 5 | < 0.1% |
| 153.51 | 5 | < 0.1% |
| 114.51 | 5 | < 0.1% |
| 125.61 | 5 | < 0.1% |
| 144.82 | 5 | < 0.1% |
| 153.6 | 4 | < 0.1% |
| Other values (10625) | 13950 |
| Value | Count | Frequency (%) |
| 35.54 | 1 | |
| 37.8 | 1 | |
| 38.32 | 1 | |
| 40.57 | 1 | |
| 40.97 | 1 | |
| 41.19 | 1 | |
| 41.59 | 1 | |
| 41.73 | 1 | |
| 42.25 | 1 | |
| 42.3 | 1 |
| Value | Count | Frequency (%) |
| 531.49 | 1 | |
| 523.29 | 1 | |
| 504.3 | 1 | |
| 494.24 | 1 | |
| 491.98 | 1 | |
| 485.7 | 1 | |
| 483.29 | 1 | |
| 476.88 | 1 | |
| 475.21 | 1 | |
| 471.27 | 1 |
quarter
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 109.5 KiB |
| 3 | |
|---|---|
| 2 | |
| 1 | |
| 4 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 3588 | |
| 2 | 3549 | |
| 1 | 3519 | |
| 4 | 3344 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 3588 | |
| 2 | 3549 | |
| 1 | 3519 | |
| 4 | 3344 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 3588 | |
| 2 | 3549 | |
| 1 | 3519 | |
| 4 | 3344 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 14000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 3588 | |
| 2 | 3549 | |
| 1 | 3519 | |
| 4 | 3344 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 14000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 3588 | |
| 2 | 3549 | |
| 1 | 3519 | |
| 4 | 3344 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 14000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 3588 | |
| 2 | 3549 | |
| 1 | 3519 | |
| 4 | 3344 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 10002 | |
| 1 | 3998 | 28.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 10002 | |
| 1 | 3998 | 28.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 10002 | |
| 1 | 3998 | 28.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 14000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 10002 | |
| 1 | 3998 | 28.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 14000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 10002 | |
| 1 | 3998 | 28.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 14000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 10002 | |
| 1 | 3998 | 28.6% |
year
Real number (ℝ)
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2007.8954 |
| Minimum | 2002 |
|---|---|
| Maximum | 2014 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 54.8 KiB |
Quantile statistics
| Minimum | 2002 |
|---|---|
| 5-th percentile | 2002 |
| Q1 | 2005 |
| median | 2008 |
| Q3 | 2011 |
| 95-th percentile | 2014 |
| Maximum | 2014 |
| Range | 12 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.6884231 |
|---|---|
| Coefficient of variation (CV) | 0.0018369598 |
| Kurtosis | -1.2020733 |
| Mean | 2007.8954 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.010049824 |
| Sum | 28110536 |
| Variance | 13.604465 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 2004 | 1098 | 7.8% |
| 2008 | 1098 | 7.8% |
| 2012 | 1098 | 7.8% |
| 2002 | 1095 | 7.8% |
| 2003 | 1095 | 7.8% |
| 2005 | 1095 | 7.8% |
| 2006 | 1095 | 7.8% |
| 2007 | 1095 | 7.8% |
| 2009 | 1095 | 7.8% |
| 2010 | 1095 | 7.8% |
| Other values (3) | 3041 |
| Value | Count | Frequency (%) |
| 2002 | 1095 | |
| 2003 | 1095 | |
| 2004 | 1098 | |
| 2005 | 1095 | |
| 2006 | 1095 | |
| 2007 | 1095 | |
| 2008 | 1098 | |
| 2009 | 1095 | |
| 2010 | 1095 | |
| 2011 | 1095 |
| Value | Count | Frequency (%) |
| 2014 | 851 | |
| 2013 | 1095 | |
| 2012 | 1098 | |
| 2011 | 1095 | |
| 2010 | 1095 | |
| 2009 | 1095 | |
| 2008 | 1098 | |
| 2007 | 1095 | |
| 2006 | 1095 | |
| 2005 | 1095 |
month
Real number (ℝ)
High correlation 
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.4428571 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 54.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.4225721 |
|---|---|
| Coefficient of variation (CV) | 0.53121962 |
| Kurtosis | -1.1892672 |
| Mean | 6.4428571 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.014464841 |
| Sum | 90200 |
| Variance | 11.714 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1209 | |
| 3 | 1209 | |
| 5 | 1209 | |
| 7 | 1209 | |
| 8 | 1209 | |
| 4 | 1170 | |
| 6 | 1170 | |
| 9 | 1170 | |
| 10 | 1148 | |
| 12 | 1116 | |
| Other values (2) | 2181 |
| Value | Count | Frequency (%) |
| 1 | 1209 | |
| 2 | 1101 | |
| 3 | 1209 | |
| 4 | 1170 | |
| 5 | 1209 | |
| 6 | 1170 | |
| 7 | 1209 | |
| 8 | 1209 | |
| 9 | 1170 | |
| 10 | 1148 |
| Value | Count | Frequency (%) |
| 12 | 1116 | |
| 11 | 1080 | |
| 10 | 1148 | |
| 9 | 1170 | |
| 8 | 1209 | |
| 7 | 1209 | |
| 6 | 1170 | |
| 5 | 1209 | |
| 4 | 1170 | |
| 3 | 1209 |
Interactions
Correlations
| amenities | apartment_type | appliance_usage | guests | is_weekend | month | period_consumption_index | quarter | residents | temperature | water_consumption | water_price | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| amenities | 1.000 | 0.192 | 0.011 | 0.010 | 0.000 | 0.000 | 0.000 | 0.000 | 0.011 | 0.000 | 0.254 | 0.003 | 0.000 |
| apartment_type | 0.192 | 1.000 | 0.013 | 0.000 | 0.000 | 0.000 | 0.012 | 0.000 | 0.000 | 0.000 | 0.371 | 0.000 | 0.011 |
| appliance_usage | 0.011 | 0.013 | 1.000 | 0.014 | 0.009 | 0.014 | 0.026 | 0.000 | 0.000 | 0.018 | 0.125 | 0.000 | 0.000 |
| guests | 0.010 | 0.000 | 0.014 | 1.000 | 0.000 | -0.004 | 0.010 | 0.000 | 0.010 | 0.009 | 0.186 | 0.004 | -0.007 |
| is_weekend | 0.000 | 0.000 | 0.009 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.005 | 0.019 | 0.000 | 0.030 | 0.000 |
| month | 0.000 | 0.000 | 0.014 | -0.004 | 0.000 | 1.000 | 0.004 | 1.000 | -0.010 | -0.011 | -0.002 | -0.002 | -0.038 |
| period_consumption_index | 0.000 | 0.012 | 0.026 | 0.010 | 0.000 | 0.004 | 1.000 | 0.000 | 0.001 | -0.027 | 0.371 | 0.003 | -0.003 |
| quarter | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 1.000 | 0.000 | 0.017 | 0.000 | 0.000 | 0.036 |
| residents | 0.011 | 0.000 | 0.000 | 0.010 | 0.005 | -0.010 | 0.001 | 0.000 | 1.000 | 0.000 | 0.725 | 0.428 | -0.002 |
| temperature | 0.000 | 0.000 | 0.018 | 0.009 | 0.019 | -0.011 | -0.027 | 0.017 | 0.000 | 1.000 | 0.148 | 0.005 | 0.008 |
| water_consumption | 0.254 | 0.371 | 0.125 | 0.186 | 0.000 | -0.002 | 0.371 | 0.000 | 0.725 | 0.148 | 1.000 | 0.466 | 0.009 |
| water_price | 0.003 | 0.000 | 0.000 | 0.004 | 0.030 | -0.002 | 0.003 | 0.000 | 0.428 | 0.005 | 0.466 | 1.000 | 0.003 |
| year | 0.000 | 0.011 | 0.000 | -0.007 | 0.000 | -0.038 | -0.003 | 0.036 | -0.002 | 0.008 | 0.009 | 0.003 | 1.000 |
Missing values
Sample
| timestamp | residents | apartment_type | temperature | humidity | water_price | period_consumption_index | income_level | guests | amenities | appliance_usage | water_consumption | quarter | is_weekend | year | month | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2002-01-01 00:00:00 | 1 | Studio | 15.31 | 46.61 | 1.06 | 0.97 | Low | 0 | Swimming Pool | 0.0 | 64.85 | 1 | 0 | 2002 | 1 |
| 1 | 2002-01-01 08:00:00 | 4 | NaN | 21.01 | 66.11 | 2.98 | 0.91 | Upper Middle | 1 | Swimming Pool | 1.0 | 192.50 | 1 | 0 | 2002 | 1 |
| 2 | 2002-01-01 16:00:00 | 2 | Cottage | 12.86 | 60.86 | 1.44 | 1.43 | Middle | 0 | NaN | 1.0 | 116.62 | 1 | 0 | 2002 | 1 |
| 3 | 2002-01-02 00:00:00 | 2 | 1BHK | 20.16 | 50.58 | 1.48 | 0.91 | Middle | -1 | Garden | 0.0 | 76.96 | 1 | 0 | 2002 | 1 |
| 4 | 2002-01-02 08:00:00 | 2 | Cottage | 16.23 | 52.25 | 1.14 | 1.11 | Middle | 0 | Fountain | 0.0 | 104.70 | 1 | 0 | 2002 | 1 |
| 5 | 2002-01-02 16:00:00 | 4 | 2BHK | 22.23 | 53.86 | 1.15 | 1.46 | Middle | 0 | NaN | 1.0 | 218.23 | 1 | 0 | 2002 | 1 |
| 6 | 2002-01-03 00:00:00 | 3 | 2BHK | 10.83 | 57.51 | 2.98 | 1.07 | Upper Middle | 0 | Swimming Pool | 0.0 | 135.80 | 1 | 0 | 2002 | 1 |
| 7 | 2002-01-03 08:00:00 | 3 | Cottage | 30.37 | 33.88 | 1.35 | 1.40 | yePea | 0 | Fountain | 0.0 | 202.29 | 1 | 0 | 2002 | 1 |
| 8 | 2002-01-03 16:00:00 | 4 | Bungalow | 16.57 | 57.94 | 2.84 | 1.47 | Upper Middle | 0 | Garden | 0.0 | 188.04 | 1 | 0 | 2002 | 1 |
| 9 | 2002-01-04 00:00:00 | 2 | NaN | 22.59 | 57.25 | 1.11 | 0.99 | Low | 1 | NaN | 1.0 | 88.94 | 1 | 0 | 2002 | 1 |
| timestamp | residents | apartment_type | temperature | humidity | water_price | period_consumption_index | income_level | guests | amenities | appliance_usage | water_consumption | quarter | is_weekend | year | month | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 13990 | 2014-10-08 08:00:00 | 2 | 1BHK | 22.37 | 62.61 | 1.22 | 1.290000 | Middle | 0 | NaN | 0.0 | 98.96 | 4 | 0 | 2014 | 10 |
| 13991 | 2014-10-08 16:00:00 | 3 | 3BHK | 23.99 | 46.91 | 2.63 | 1.150000 | Upper Middle | 0 | NaN | 0.0 | 149.44 | 4 | 0 | 2014 | 10 |
| 13992 | 2014-10-09 00:00:00 | 4 | Bungalow | 15.22 | 48.33 | 2.45 | 0.890000 | Upper Middle | 0 | NaN | 0.0 | 117.03 | 4 | 0 | 2014 | 10 |
| 13993 | 2014-10-09 08:00:00 | 3 | 1BHK | 24.27 | 53.0 | 1.00 | 1.500000 | Middle | 1 | NaN | 1.0 | 213.07 | 4 | 0 | 2014 | 10 |
| 13994 | 2014-10-09 16:00:00 | 2 | 1BHK | 26.52 | 47.19 | 1.47 | 2.082106 | Low | 0 | NaN | 0.0 | 111.68 | 4 | 0 | 2014 | 10 |
| 13995 | 2014-10-10 00:00:00 | 2 | 1BHK | 25.61 | 61.5 | 1.70 | 0.940000 | Low | 0 | NaN | 0.0 | 78.59 | 4 | 0 | 2014 | 10 |
| 13996 | 2014-10-10 08:00:00 | 5 | 2BHK | 13.27 | 52.58 | 1.88 | 1.030000 | Upper Middle | 0 | Garden | 1.0 | 185.50 | 4 | 0 | 2014 | 10 |
| 13997 | 2014-10-10 16:00:00 | 4 | 2BHK | NaN | 46.93 | 1.22 | 1.100000 | Middle | 0 | NaN | 1.0 | 180.28 | 4 | 0 | 2014 | 10 |
| 13998 | 2014-10-11 00:00:00 | 4 | 3BHK | 11.62 | 64.48 | 2.86 | 1.120000 | Upper Middle | 1 | Swimming Pool | 0.0 | 212.19 | 4 | 1 | 2014 | 10 |
| 13999 | 2014-10-11 08:00:00 | 4 | 2BHK | 23.78 | 44.88 | 1.26 | 2.133695 | c&8%1 | 1 | Jacuzzi | 0.0 | 303.59 | 4 | 1 | 2014 | 10 |